Prevalence of X-ray variability in the Chandra Deep Field South 
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ABSTRACT 

We studied the X-ray variability of sources detected in the Chandra Deep Field South (Giac- 
coni et al. 2002), nearly all of which are low to moderate z AGN (Tozzi et al. 2001). We find that 
45% of the sources with > 100 counts exhibit significant variability on timescales ranging from 
a day up to a year. The fraction of sources found to be variable increases with observed flux, 
suggesting that > 90% of all AGNs possess intrinsic variability. We also find that the fraction 
of variable sources appears to decrease with increasing intrinsic absorption; a lack of variability 
in hard, absorbed AGNs could be due to an increased contribution of reflected X-rays to the 
total flux. We do not detect significant spectral variability (Ar > 0.2) in the majority (~ 70%) 
of our sources. In half of the remaining 30%, the hardness ratio is anti-correlated with flux, 
mimicking the high/soft-low/hard states of galactic sources. The X-ray variability appears anti- 
correlated with the luminosity of the sources, in agreement with previous studies. High redshift 
sources, however, have larger variability amplitudes than expected from extrapolations of their 
low-z counterparts, suggesting a possible evolution in the accretion rate and/or size of the X-ray 
emitting region. Finally, we discuss some effects that may produce the observed decrease in the 
fraction of variable sources from z = 0.5 out to z = 2. 

Subject headings: galaxies: active — galaxies: nuclei — X-rays: galaxies — galaxies: evolution 



1. Introduction 

Active Galactic Nuclei (AGN) are found to vary 
on timescales ranging from minutes to years. Vari- 
ability studies have proven to be an important 
tool to investigate the innermost regions of AGNs, 
which cannot be resolved with the current gener- 
ation of astronomical instruments. The timescale 
of the variability, its spectral dependence and the 
correlation between different line and continuum 
components provide fundamental clues on the na- 
ture of the physical processes which occur near 
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to the central Black Hole (BH) and on the size 
and relation between the different regions produc- 
ing the observed emission (see Ulrich, Maraschi & 
Urry 1997, for a comprehensive review). 

In the X-ray band, AGNs show faster variability 
than in any other band, consistent with the X-ray 
emission originating from a small region very close 
to the central BH (sec Mushotzky, Done & Pounds 
1993). Several promising models have been pro- 
posed to explain the fast intrinsic variability and 
its characteristics: a single coherent oscillator (e.g. 
Almaini et al. 2000), variable decay shot-noise due 
to a superposition of individual flares (e.g. Lehto 
1989), and bright rotating spots spiraling around 
the BH (Abramowicz et al. 1991). All these mod- 
els predict a dependence on the BH mass, accre- 
tion rate and the size of the X-ray emitting region 
which may in turn explain the observed depen- 
dence of variability on the AGN luminosity (e.g. 
Lawrence & Papadakis 1993; Green, McHardy & 
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Lehto 1993; Nandra et al. 1997) and spectral prop- 
erties (Nandra et al. 1997; Turner et al. 1999; Fiore 
et al. 1998). 

Most variability studies in the X-ray band are 
based on samples of bright and nearby AGNs for 
which good quality data are available from past X- 
ray missions. Moreover they usually sample short 
timescales from hours to days. Only for the near- 
est sources have long monitoring campaigns al- 
lowed a detailed study of long-term variability up 
to several years. The Chandra Deep Field South 
(Giacconi et al. 2001, hereafter CDFS) represents 
one of the deepest observations of the Universe 
in the X-ray band (the only deeper sample is in 
the Northern Chandra Deep Field). Rosati et al. 
(2002) detected 346 sources in the CDFS, resolv- 
ing > 90% of the hard X-ray background. The 
X-ray luminosities and spectral properties of the 
sources revealed that ~ 80% of them are AGNs 
(Giacconi et al. 2001; Tozzi et al. 2001). Optical 
follow-up studies showed that they are equally di- 
vided among moderate redshift (z < 1.6) Type II 
AGNs with large intrinsic absorption (log Nh > 
21.5 cm -2 ), and unabsorbed Type I AGNs cover- 
ing a large range of redshifts up to z ~ 3.6, both 
hosted by a broad range of galaxy types (Schreier 
et al. 2001; Koekemoer et al. 2002; Rosati et al. 
2002). The remaining 20% of the sample is com- 
posed of a mixture of normal galaxies, starburst 
galaxies, galaxy clusters (Giacconi et al. 2001; 
Koekemoer et al. 2002) and individual off-nuclear 
X-ray sources (Hornschemeier et al. 2004). 

Being composed of several concentric exposures 
obtained over a period of ~ 1 year, the CDFS rep- 
resent an ideal dataset to study the X-ray vari- 
ability of faint and distant AGNs, on timescales 
ranging from a day up to several months. In the 
current study we measure the variability proper- 
ties of the AGN sample and correlate the vari- 
ability with other X-ray properties (flux, luminos- 
ity, hardness ratio, etc.), compare the properties of 
these moderate- z AGN to those of the well studied 
nearby AGN population and study their evolution 
with look-back time. The paper is organized as 
follows: in § 2 we describe the dataset, in § 3 we 
illustrate the variability analysis, in § 4 and 5 we 
present and discuss our results and in § 6 we draw 
the conclusions. 

Throughout the paper we assume a cosmologi- 
cal model with H = 70 km s -1 Mpc -1 , Q m = 0.3 



and Q A = 0.7. 

2. The data 

The CDFS (Giacconi et al. 2001) was observed 
11 times by the Chandra X-ray Observatory from 
October 1999 to December 2000. The diary of the 
observations and their properties are described in 
detail in Giacconi et al. (2002), Tozzi et al. (2001), 
Giacconi et al. (2001); we summarize in Table 
1 the properties of each exposure that are rele- 
vant to this paper. Each observation was filtered 
to include only standard ASCA event grades, hot 
pixels and columns were removed and high back- 
ground intervals were subtracted; all exposures 
were then added to produce a "stacked observa- 
tion" of 942 ks (see Giacconi et al. 2002, for de- 
tails). Exposure maps were produced for each in- 
dividual observation in both the soft (0.5-2.0 keV) 
and hard (2.0-7.0 keV) bands, and then combined 
using the appropriate shift and weighting for in- 
dividual exposure times. Throughout this paper 
we used the hard band exposure maps; we will 
discuss the uncertainties introduced by this choice 
in §3.3. The stacked observation was used to de- 
tect X-ray sources both through a modified version 
of the SExtractor algorithm (Bertin & Arnouts 
1996) and the WAVDETECT (Dobrzycki et al. 
1999; Freeman et al. 2001) algorithm included in 
the CIAO 1 software. The final catalogues, includ- 
ing 346 sources, are presented in Giacconi et al. 
(2002). 

3. Data analysis 

3.1. Lightcurve extraction 

To study the temporal variability of sources de- 
tected in the CDFS we extracted counts from both 
the 11 individual exposures and the stacked ob- 
servation for each source in the Giacconi et al. 
(2002) catalogue, in the 0.5-7.0 keV band. We 
adopted the 95% encircled-energy extraction radii 
Rs used by Giacconi et al. (2002), which take into 
account the broadening of the PSF full width half 
maximum (FWHM) with off-axis angle. Similarly 
the local background was measured in an annulus 
with inner and outer radii of Rs + 2 and Rs + 12 
pixels respectively, after masking out overlapping 

1 http://cxc. harvard.edu/ciao 
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sources. The source counts were then background- 
subtracted and exposure corrected to match the 
observing condition at the ACIS-I aimpoint. Er- 
rors were computed using the Gehrels approxima- 
tion cr rj 1 + Vn + 0.75 (Gehrels 1986), which pro- 
vides a more reliable estimate in the low count 
regime. 

Even though the observations which compose 
the CDFS have similar aimpoints, they were de- 
signed to have small shifts between exposures, to 
allow a more uniform coverage of the gaps between 
the ACIS CCDs; furthermore they have different 
roll angles. For this reasons (and because of the 
square shape of the ACIS-I detector) sources near 
the edge of the CDFS field of view (FOV) may 
fall inside the CCDs in some observations, while 
lying outside the detector or partially on the edge 
in others. In order to maximize the number of 
data bins for each source, we decided to retain 
those bins corresponding to observations in which: 
1) less than 10% of the source area and 2) less 
than 50% of the background region fell outside the 
FOV. These cases however are a minority (15% of 
all sources) and we verified that they do not affect 
our analysis, as discussed in § 3.3. We excluded 
from the subsequent analysis 6 sources which had 
only one acceptable observation. An example of a 
typical lightcurve obtained in this way is shown in 
Figure 1. 

3.2. Variability estimates 

A commonly used quantity to estimate variabil- 
ity in literature is the lightcurve \ 2 defined as: 

X ~ N obs - 1 2-, a f w 

where Nobs is the number of observations in which 
the sources falls inside the FOV, n, and m are the 
number of counts and its error measured in the 
i th observation after background subtraction and 
exposure correction, and n is the average number 
of counts extracted from the stacked observation. 

A large part of the sources detected in the 
CDFS, however, have less than 100 counts, which 
implies on average < 10 counts/observation. It 
is well known that, in such low counts regime, 
the Poissonian distribution of X-ray events signif- 
icantly differs from a Gaussian distribution. This 
difference is further enhanced by our choice to use 



the Gehrels error formula discussed above. The 
final counts (background subtracted and exposure 
corrected), obtained through the standard error 
propagation, are not normally distributed any- 
more so that the probability estimates derived 
from the x 2 distribution in such approximation 
are not accurate (a detailed discussion of this is- 
sue can be found in the IRAF/PROS Xray pack- 
age 2 , also see Nousek & Shue 1989; Wheaton et al. 
1995). To properly deal with the small number of 
counts, we performed 1000 Monte-Carlo simulated 
observations of each source, mimicking the exact 
conditions of each exposure. The simulated source 
is composed of the superposition of source and 
background counts extracted from a Poisson dis- 
tribution with the same median as the real source, 
in the assumption that the flux is non-variable. 
The details of the simulations are described in Ap- 
pendix A. The simulated counts were background 
subtracted, exposure corrected and divided by the 
exposure times exactly as done for the real sources. 
Using the same Gehrels error approximation used 
for real observations, we thus derived 1000 sim- 
ulated x 2 f° r eacn source. In Figure 2 we com- 
pare the x 2 distribution of the CDFS sources with 
those obtained from our simulation and with the 
one expected in the Gaussian approximation (in 
which case each source \ 2 would follow a proba- 
bility distribution given by an incomplete gamma 
function r(j//2,x 2 /2) where v are the degrees of 
freedom; see Press et al. 1992). The actual \ 2 dis- 
tribution has a lower median than the Gaussian 
prediction. This is expected since the Gehrels er- 
ror approximation is properly an upper limit and 
always larger than the Gaussian estimate \fn, re- 
sulting in lower \ 2 ■ O n the other end the sim- 
ulated distribution agrees well with the observed 
one, except for \ 2 > 20 where the presence of 
variable sources (not contemplated in the simu- 
lations) produces an extended tail; note that the 
larger peak value in the simulated distribution is 
due to the fact that the histograms are normalized 
to the total number of sources. 

The variability of the sources was estimated 
comparing the \ 2 measured from the lightcurve 
to those obtained from the simulated observations. 
We derived two variability estimates: in the first 
one the comparison is made between the total % 2 , 

2 http: / /iraf . noao.edu/ scripts / irafhclp?cxplain_errors 
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i.e. summing the deviations in all bins as in ex- 
pression (1). The second estimate is obtained us- 
ing just the bin in which we observe the maximum 
deviation from the mean, i.e. using the quantity 

XL,= . max j 1 ^-^) (2) 

While the first estimate provides an average esti- 
mate over the whole lightcurve, and thus is bet- 
ter suited for sources which have small and re- 
peated fluctuations, the second one allows to de- 
tect variability in sources exhibiting rare bursts, 
which may be undetected when we average over 
the whole lightcurve. We classify as variable those 
sources whose probability to obtain a x 2 smaller 
than the measured one, in the assumption of a con- 
stant count rate, is P(x 2 ) > 95% for the averaged 
estimate or P m ax(x 2 ) > 99% in the maximum de- 
viation estimate. A higher limit was adopted for 
Pmaxix 2 ) since its distribution is skewed toward 
larger values by definition; furthermore P m ax{x 2 ) 
may be more affected by deviations occurring in 
a single observation than P(x 2 ), which is a more 
robust estimate since it is averaged on the whole 
lightcurve. This choice however does not signifi- 
cantly affect our results: the use of P(x 2 ) > 99% 
would change the variable/non- variable classifica- 
tion of only 4 sources (2 if we consider objects with 
> 100 counts, as done in most of our analysis). 

To obtain a quantitative measure of the vari- 
ability amplitude we calculated the "excess vari- 
ance" of the CDFS sources (Nandra et al. 1997; 
Turner ct al. 1999): 

1 N obs 

Vrrns = M _ 2 Y. [("i ~ n) 2 - <Tj ] (3) 

Nob s n 

ffrms measures the fraction of the total flux per 
bin which is variable, after subtracting the statis- 
tical error. We also calculated a "maximum excess 
variance" for each source, defined as: 

The accuracy of a rms as a variability estimator 
depends on the S/N ratio of the sources, as shown 
in Figure 3. Faint sources with < 100 net counts 
lie mostly below the 95% upper limit derived from 
simulations (continuous line) indicating that sta- 
tistical uncertainties dominate the excess variance 



estimates, thus making a rms unreliable. At higher 
count levels our sensitivity improves and an in- 
creasingly larger fraction of the sources is detected 
as variable. In the high statistics regime a rms is 
well correlated with the variability estimates ob- 
tained from the x 2 analysis, in the sense that all 
sources with a rms above the 95% limit are found 
to be variable. Below 100 counts such correla- 
tion docs not hold anymore: this is due to the 
fact that while the x 2 analysis correctly estimates 
the probability that a source is variable through 
comparison with the simulations, the correction 
for the statistical error in expression (3) is based 
on a Gaussian approximation which fails in the 
low counts regime. Also note that the 95% limit 
shown in Figure 3 is an average value: the actual 
limit for each individual source depends on the ob- 
serving conditions (exposure time, position within 
the FOV, PSF etc.) which may vary, thus explain- 
ing why a few variable sources with > 100 counts 
are scattered below this limit. 

Almaini et al. (2000) point out that expression 
(3) represents the best variability estimator only 
for identical measurements errors [<Ji ^constant) 
and otherwise a numerical approach should be 
used. However, in our case, their method is not 
strictly correct either because our measurements 
are not normally distributed. We thus decided to 
adopt the simpler analytical approach; the errors 
in our variability estimates are dominated anyway 
by statistical uncertainties due to the low count 
rate, as discussed in next section. 

3.3. Sources of uncertainty 

It is possible for the intrinsic variability present 
in our sources to be masked or enhanced by sta- 
tistical errors or systematic effects. In Appendix 
B we discuss the principal sources of systematic 
uncertainty which may affect CDFS sources: QE 
degradation, exposure corrections, PSF variations, 
corrections for sources on edges. We find that 
these effects influence our count rate estimates 
by less than 10%. This implies that the main 
source of uncertainty in the majority of our sources 
is the statistical error. In fact, statistical errors 
<7i ~ *Jnl drop below 10% only for sources with 
rii > 100 counts per observation on average, i.e. 
ntot > 1100 total counts. The CDFS contains just 
11 such sources and all of them exhibit much larger 
flux variations. Note however that, as pointed out 
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in the previous section, we can draw significant 
conclusion for all sources with > 100 total counts. 
In the discussion below we consider the effect of 
these uncertainties on our results. 

4. Results 

4.1. Variable fraction 

Through the x 2 analysis we are able to detect 
variability in 74 out of 340 sources, i.e. <~ 20% of 
the sample, in the 0.5-7 keV band. However our 
ability to detect flux variations depends on the 
S/N ratio of the sources: for sources with < 100 
counts the observed flux variations are dominated 
by statistical fluctuations (§ 3.2). In the follow- 
ing analysis we therefore concentrate on sources 
with > 100 total counts, where the statistical un- 
certainties drop below a rms ~ 50% (continuous 
line in Figure 3). The faintest flux variations 
that we detect in these sources corresponds to 

Crms ~ 10%. 

To better restrict our variability analysis to the 
AGN population, we can also exclude from the 
sample sources which are not a-priori expected to 
possess any temporal variability. Giacconi et al. 
(2002) found that 19 of the CDFS sources were 
spatially extended and thus arc unlikely to be vari- 
able. Furthermore at low luminosities (Lx < 10 42 
erg s _1 ) the X-ray population is believed to con- 
tain a large fraction of starburst and/or normal 
galaxies (e.g. Alexander et al. 2002). We de- 
rived intrinsic X-ray luminosities in the 0.5-7 keV 
band using the photometric redshifts calculated 
by Mobasher et al. (2003) using Hubble-ACS and 
ground-based optical and NIR data acquired as 
part of the GOODS collaboration (Giavalisco et 
al. 2003). Luminosities were k-corrected assuming 
a power law spectrum with T = 1.8 3 (Alexander 
et al. 2003). 

The dependence of the fraction of variable 
sources on the total counts is shown in Figure 
4. The fraction increases from 45% considering 
sources with > 100 counts, to 94% for sources with 
> 800 counts. Excluding extended sources and 
those with Lx < 10 42 erg s _1 (dashed line) the 
result does not change substantially, confirming 



The use of more complex models including an average ab- 
sorption to compute the k-correction does not significantly 
change the results. 



that AGNs dominate our sample. This increase in 
the variable fraction with counts indicates that, as 
we become more sensitive to detecting variability, 
we systematically find a larger number of vari- 
able sources. Even though the limited number of 
objects requires caution in extrapolating the vari- 
able fraction to higher counts, Figure 3 further 
suggests that all sources with > 1000 counts may 
be variable. 

4.2. Variability dependence on hardness 
ratio 

The increase in the fraction of variable objects 
with total flux, discussed in the previous section, 
is obviously partly due to the increased S/N ratio 
of the sources. A second effect, however, seems to 
be present. Comparing the hardness ratio HR=H- 
S/H+S, where H and S are the counts measured 
from the stacked observation in the soft (0.5-2 
keV) and hard (2-7 keV) bands, with the net 
source counts (Figure 5), we see that the average 
HR decreases as the sources grow stronger. Fur- 
thermore the majority of variable objects are soft 
(HR< —0.2). Thus, choosing higher flux sources 
systematically selects a larger fraction of the soft 
population, which appears to contain the majority 
of variable sources. 

The counts vs HR distribution can be explained 
by the presence of absorbing material near the cen- 
tral AGN. In Figure 5 we plot the tracks followed 
by a source at given redshift as we increase the 
intrinsic absorption. We assumed a source with 
power law spectrum of T = 1.8 normalized to 1000 
counts (fx — 10~ 14 erg s^ 1 cm~ 2 ) at z = and a 
galactic column density of Njj — 8 x 10 19 cm~ 2 . 
The effect of adding obscuring material is to make 
the source fainter and harder, to an extent which 
depends on the redshift of the source. The plot 
shows that the bulk of the variable population is 
characterized by lower column densities than the 
non- variable one. 

Considering the subsample with available pho- 
tometric redshifts the connection between vari- 
ability and absorption is even clearer. In Fig- 
ure 6 we plot the HR vs redshift of sources with 
> 100 counts, excluding the extended sources and 
those with L x < 10 x 10 42 erg s" 1 (see § 4.1). 
Variable sources appear systematically softer than 
non- variable ones at all redshifts. Moreover, com- 
paring the HR of the sources with the one ex- 
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pected assuming an intrinsic power law spectrum 
(r = 1.8) and increasing levels of photoelectric ab- 
sorption, we find that 2/3 of variable sources have 
Nh < 10 22 - 5 cm~ 2 . The opposite effect holds for 
non- variable sources. 

We tried different approaches to determine if 
the lower fraction of variable sources in the ab- 
sorbed (log Nh > 22.5 cm~ 2 ) subsample is an 
intrinsic feature of this population or is a selec- 
tion effect due to the fact that such sources are 
also fainter than the corresponding unabsorbed 
(log Nh < 22.5 cm~ 2 ) counterparts and thus less 
likely to be detected as variable. 

First, to measure the effect that a lower count- 
rate has on the fraction of variable sources that 
we detect, we artificially rescaled the counts mea- 
sured in each exposure for the unabsorbed popula- 
tion, so that their median counts match the me- 
dian counts of the absorbed subsample. We note 
that a simple rescaling by a constant factor of the 
measured counts would not change the S/N ratio 
of the sources, and thus would not correctly sim- 
ulate a fainter population. In order to correctly 
model the count distribution, we re-extracted the 
source counts from a Poisson distribution whose 
mean is the rescaled value. We then re-estimate 
their variability through the procedure outlined in 
§ 3.2. This approach preserves the intrinsic vari- 
ability of the sources while simulating the effects 
of a lower statistic. In the assumption that ab- 
sorbed and unabsorbed sources have the same in- 
trinsic variability and that the observed differences 
are only due to a different average S/N ratio, we 
would expect the rescaled unabsorbed population 
to have the same variable fraction as the absorbed 
one. We find a variable fraction of 45% in the 
rescaled unabsorbed population. Assuming that 
this is the fraction of variable sources expected in 
the absorbed population due only to the lower S/N 
ratio, we calculate that the probability of finding 
9 variable sources out of 29 with > 100 counts, as 
observed in the absorbed population, is < 5%. 

Second, we compared the excess variance distri- 
bution of absorbed and unabsorbed sources. Being 
corrected for statistical errors, the excess variance 
is less affected by the count-rate bias discussed be- 
fore. Furthermore, since <r 2 ms represents the frac- 
tional variation of the total flux, it is not expected 
to change due only to photoelectric absorption be- 
cause both the average flux and its fluctuations 



are attenuated by the same factor. In Figure 7 
we show the cumulative c 2 ms distributions for the 
absorbed and unabsorbed subsamples. The null hy- 
pothesis that the two subsamples are extracted 
from the same distribution is rejected at the 96% 
level. 

While none of these tests is conclusive, they 
suggest that the lower variability observed in the 
hard absorbed population is an intrinsic feature 
and is not due to selection effects. 

4.3. Characterizing the variability 

4-3.1. Total flux variability 

The lightcurves of our X-ray sources are very 
heterogeneous: we find some sources which ex- 
hibit smooth flux declines and increases and oth- 
ers with completely random behaviors 4 . The ex- 
cess variance cr 2 ms and the maximum excess vari- 
ance cr„ aa . measure the (squared) fraction of the 
total flux which is variable in the observed band- 
pass, and these are useful quantities to character- 
ize the source variability. In Figure 8 we show the 
distribution of cr 2 ms and cr 2 aaa . measured in the 
CDFS sources. The variable subsample spans a 
large range of er 2 ms , ranging from a few % up to 
~ 95% of the total emission, with a median value 
of a rms ~ 30%. The o~^ nax distribution further 
shows that half of the variable subsample exhibits 
maximum flux variations > 90%, with peaks up 
to > 230% [a 2 max = 5.3). The distribution of both 
<7 2 ms and (7^ m for variable sources represent the 
upper tail of the intrinsic distribution: there is in 
fact a large fraction of sources with low variable 
fluxes which can not be detected as variable with 
the present data due to the large statistical errors. 

4-3.2. Spectral variability 

We studied the spectral variability of the 
sources by comparing the average HR of each 
source to the one measured in the individual ex- 
posures. Limiting the analysis to the 27 variable 
sources with > 500 counts, where the statistics is 
high enough to allow an accurate HR determina- 
tion in each bin, we find that only 4 sources exhibit 
spectral variability significant at the 2a level (Fig- 

4 Note that this is true also for sources with good statistics, 
where the random behavior cannot be attributed to statis- 
tical uncertainties. 
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lire 9). In 3 of them the variability is detected in 
just one exposure. We further looked for a corre- 
lation between total counts and HR, finding that 5 
objects show significant correlation according to a 
rank correlation test: for all of them but one, the 
HR is anticorrelated with the total counts, mean- 
ing that as the source grows stronger, its spectrum 
gets softer (Figure 9, right panel). We point out 
that our sensitivity to HR changes is limited by 
the small statistics and varies depending on the 
total counts of the source. The average error on 
the HR in a single observation goes from ~ 0.25 
at n = 100 counts to 0.1 at n = 1000, making 
power-law slope variations Ar < 0.2 undetectable 
with the present data. 

We conclude that ~ 30% of the sources with 
> 500 counts show some sign of spectral variabil- 
ity, with spectral changes Ar > 0.2. Furthermore, 
while we observe the presence of a softening of 
the spectrum with flux strength in <~ 15% of the 
sources, we find no convincing evidence of the exis- 
tence of objects which show a spectral hardening. 
Note that we find no significant correlation with 
redshift, indicating that the lack of spectral vari- 
ability in the majority of the sources is not due to 
the harder band sampled at higher z. 

4-3.3. Variability timescales 

To study the variability timescales of our 
sources we compared the difference An = n, — nj 
in the net counts between each pair of bins i, j 
where i <E {0, • • • , 10} and j > i, with the one 
predicted by the simulations for a source with 
constant flux. We assumed that a source is signif- 
icantly variable on a given timcscalc At = \ti — tj\ 
if the probability of observing a smaller An 
is > 95%. The distribution of the measured 
timescales is shown in Figure 10 (left panel). Each 
bin has been normalized to the number of times 
the corresponding timescale has been sampled in 
all sources to account for the irregular distribu- 
tion of the X-ray observations and the missing 
data in the lightcurves of sources falling near the 
edge of the FOV. The gaps in the histogram rep- 
resent timescales which have not been sampled 
by our data. In practice the histogram represents 
the probability of detecting variability on a given 
timescale At. 

In the right panel we show the distribution of 
timescales At rest = At ■ (1 + z)^ 1 in the source 



rest-frame (i.e. correcting for time dilation) for 
the subsample with available photometric redshift. 
Due to the redshift distribution of the sources, this 
time we are able to sample all timescales ranging 
from less than one day up to a year, almost with- 
out any gap. We find evidence of variability on 
all timescales; however the figure shows that long- 
term variability prevails and that it is <~ 5 times 
more likely to observe variability on timescales 
> 100 days than on less than a day. We stress that 
this difference is not due to a different sampling of 
short and long timescales since the histograms are 
normalized, nor to the different S/N ratio of the 
individual observations, since the average length 
of the December 2000 exposures (used to sample 
the short term variability; see Table 1) is compa- 
rable to the average of all exposures. 

In Figure 11 we present the distribution of 
the minimum rest-frame timescale observed in 
each lightcurve. The plot shows that 70% of the 
variable sources possess variability on timescales 
smaller than 2 days. This means that even though 
long-term variability is dominant in our sample, 
in the sense of observing a source at two different 
times and finding a higher probability of detecting 
variability on long timescales than on short ones, 
short-term variability is still found in the majority 
of the sources. 

4.4. Luminosity-variability anticorrelation 

It is known that the variability amplitude of 
AGNs is anti-correlated with luminosity, in the 
sense that brighter sources exhibit smaller vari- 
able flux fractions than fainter ones (e.g. Barr & 
Mushotzky 1986; Lawrence & Papadakis 1993; 
Nandra et al. 1997). In Figure 12 we plot the lumi- 
nosity of the CDFS sources with available redshifts 
versus their excess variance. The luminosities are 
calculated in the 0.5-7 keV band and k-corrected 
assuming a power law spectrum with T = 1.8. In 
this figure we see little correlation between the two 
quantities, as expected given the large errors both 
on the excess variance, due to the small statistics, 
and on the luminosities, reflecting the uncertain- 
ties in the photometric redshifts. To enhance any 
underlying Lx -variability correlation we divided 
the luminosity range in logarithmic bins and cal- 
culated the biweight mean (which is less sensitive 
to outliers than the regular mean) of the excess 
variance in each of them. We can sec that in the 
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10 < Lx < 10 range the average values show 
anticorrelation of the variable flux fraction with lu- 
minosity. However the brightest sources, at high 
redshifts, have large excess variance, inconsistent 
with extrapolating the trend of lower z sources to 
high luminosities. It may be argued that the use 
of a fixed energy band does not sample the same 
spectral range for sources at different z. In prin- 
ciple we do not expect that the use of different 
energy bands significantly affects the conclusions 
since the majority of our sources do not show de- 
pendence of variability on the energy range (see § 
4.3.2). However, to further test this, we extracted 
luminosities and variabilities in the 2-7 keV rest- 
frame band finding that, while the scatter of the 
sources is increased due to the smaller statistics 
(we are using less than half of the detected pho- 
tons), the high z sources still have systematically 
higher variability inconsistent with low z sources. 

Using the maximum excess variance u max (Fig- 
ure 13) the correlation for sources in the redshift 
range 0.5 < z < 1.3 is more evident. Note that 
this is the redshift range where we find the ma- 
jority of our sources and where we are able to 
sample a large range of luminosities (Figure 14). 
This is due to the smaller uncertainties associated 
with a m ax and, to a lesser extent, to the fact that 
the use of a max tends to reduce the scatter due 
to the random sampling of our sources. A least 
squares bisector fit (allowing for uncertainties in 
both coordinates, Isobe et al. 1990) yields a power- 
law slope of 1.17 ± 0.14 (-1.31 ± 0.23 using 

a rms instead of a max ). Again high redshift sources 
are significantly more variable, while sources at 
z < 0.5 seem to show the opposite trend, ly- 
ing preferentially in the left region of the plot. 
We point out that no correction for time dilation 
has been applied to the data since it would re- 
quire to make a somewhat arbitrary assumption 
about the Power Density Spectrum (PDS) of our 
sources (which we cannot reliably measure with 
the present data). However note that assuming 
a power-law PDS with a slope ~ 1.5, as seen in 
local AGNs, would further enhance the variability 
of high redshift sources. 

4.5. Dependence of variable fractions on 
redshift 

In Figure 14 we plot the luminosity of CDFS 
sources as a function of photometric redshift. To 



study the dependence of the fraction of variable 
sources N var /NTot on redshift, we measued the 
running average value of N var /NTot using a bin 
width 5 Az = 2 (Figure 14, inner panel), lim- 
iting the analysis to objects with > 100 counts 
and Lx > 10 42 erg s _1 . The large bin width 
minimizes uncertainties due to variations in the 
source flux. The median source counts (~ 300) 
are constant with redshift within <~ 15% and thus 
their distribution does not affect significantly our 
variable/non- variable classification (see Figure 4). 

The plot shows that the fraction of variable 
sources decreases from z = to z <~ 2 and then 
tends to increase again at higher redshifts. We find 
that this trend is significant at the <~ 2.5er level. 
We checked for energy band and timescale selec- 
tion effects. Using the rest-frame 2-7 keV band 
we confirmed the decreasing trend at the 2a level 
(note that the lower significance level is mainly 
due to the smaller number of sources in the sam- 
ple with > 100 counts). Time dilation produces 
a difference in the rest-frame timescales between 
the medium and high redshift bins of a factor ~ 3 
which, according to Figure 10 would affect the 
variable fractions to < 10%. 

We further note that the value of the N var /NTot 
ratio is scarcely affected by which luminosity range 
is sampled at any given redshifts because, while 
more luminous sources are less variable based on 
the Lx-variability relation, they also have larger 
fluxes making variability more easily detectable. 
Figure 15 shows that these two effects closely bal- 
ance each other, since the average variability de- 
tection limit for sources at a given redshift has the 
same slope as the L x — cr^ ms correlation. On the 
other hand if all AGN obey the same (in terms of 
slope and normalization) Lx — Crms correlation of 
local AGNs, we wouldn't expect to detect any vari- 
able source at high redshift where we sample only 
the brightest objects. The fact that we do detect 
variability in distant AGNs is due to the increased 
a rms °f t nese sources which balances the higher 
variability detection limit (Figure 15). Thus, while 
an increase N var / 'Nxot ratio at high redshift is ex- 
plained by the the high-luminosity sources which 
are more variable than their low-z counterparts, 

5 At the edges of the interval covered by our data the bin size 
is truncated on one side, thus reaching an effective width 
of Az = 1 for z = and z = 3.5 
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the decreasing trend for z < 2 is harder to ex- 
plain. We discuss some tentative interpretations 
of this result in § 5.4. 

5. Discussion 

5.1. Prevalence of variability 

The analysis of the X-ray sources detected in 
the CDFS reveals that (§4.1) 20% of the sample 
is variable in the 0.5-7 keV band. However, ex- 
cluding sources with too few counts to measure 
variability (< 100 counts), this fraction becomes 
45% and increases with total flux up to 94% at 800 
counts, implying that we detect variability when- 
ever we have enough photon statistics to measure 
it. This is consistent with the analysis performed 
by Bauer et al. (2002, 2003) in the Chandra Deep 
Field North, where the authors detect a fraction 
of variable sources as high as 90% when adequate 
statistics is available. 

This result suggests that the vast majority 
of AGNs possess X-ray variability on timescales 
ranging from days to months. However we have 
also shown (§ 4.2) that the flux of the sources is 
correlated with their hardness ratio, in the sense 
that the brightest sources tend to be softer than 
the fainter ones. We studied the dependence of 
the variable fraction on counts for the hard and 
soft subsamples separately and we found the same 
increase in the variable fraction with counts, even- 
tually reaching 100%, even though the significance 
of the results is lower than for the whole sample 
due to the smaller number of sources. We con- 
clude that that both hard and soft populations 
are highly variable, even though in harder sources 
our ability to detect flux variations is reduced due 
to a combination of smaller intrinsic fluctuations 
and a lower average flux. 

As discussed in § 4.3, our sources are very 
heterogeneous in terms of variability: most 
lightcurves are characterized by random fluctu- 
ations, some also show well defined trends with 
flux slowly increasing or decreasing over several 
months. This complex behavior is represented 
in the histograms in Figure 10 which show vari- 
ability at all timescales from a fraction of a day, 
up to a year. Long term variability dominates 
our sample: detecting variability on timescales 
longer than a month is twice as likely than on 
timescales smaller than 10 days. This is in quali- 



tative agreement with the power-law shape of the 
PDS characterizing the variability of local AGNs, 
which implies larger variability amplitudes on long 
timescales and thus a larger chance of detecting 
flux variations. On the other hand we detect short- 
term variability (< 2 days) in the majority of our 
sources, meaning that part of the flux variations 
are produced on spatial scales < 2 x 10~ 3 pc, cor- 
responding to the inner part of the accretion disk 
and and of the Broad Emission Line region. This 
suggests that whatever process is responsible for 
the variability must account for a wide range of 
timescales or that several different processes are 
contributing at the same time. 

We do not find any significant (Ar > 0.2) spec- 
tral variability in 70% of the sample (§ 4.3.2). 
In half of the remaining sources the HR appears 
anti-correlated with the flux, similar to the high- 
soft/low-hard behavior of galactic sources (van 
der Klis 1995). This lack of correlation between 
variability and spectrum in most sources argues 
against variable obscuration as the main mecha- 
nism responsible for X-ray variability, even though 
small column density changes would not be de- 
tectable in the present data. The presence of 
sources in which spectral variations are indeed ob- 
served but appear uncorrelated with flux, indi- 
cates that in these objects the spectral changes are 
not triggered by X-ray luminosity fluctuations. 

5.2. Variability dependence on absorption 

In § 4.2 we showed that the variable and non- 
variable sources represent two distinct populations 
in terms of their absorption. The observation that 
variable sources are significantly softer, on aver- 
age, than non- variable ones can be explained by in- 
voking different column densities of absorbing ma- 
terial, which affects mostly photons below 2 keV, 
making sources both fainter and harder (Figure 5 
and 6). This separation between variable and non- 
variable sources according to intrinsic absorption 
mimics closely the one between Type I and Type 
II AGNs shown in Figure 9 of Tozzi et al. (2001). 
Even though it is difficult to separate the actual 
dependence of variability on absorption from the 
spurious effect introduced by the lower statistics 
in fainter sources (which makes variability harder 
to detect) our tests suggest, at the 2a confidence 
level, that there is an intrinsic difference in the 
variability of the absorbed and unabsorbed popula- 
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tions. 

Previous works also found a correlation between 
the spectral properties of the sources and their 
variability, in the sense that the more variable 
sources are those with steeper spectra (e.g. Green, 
McHardy & Lehto 1993; Koenig, Staubert & 
Wilms 1997; Turner et al. 1999; Grupc, Thomas & 
Beuermann 2001). The origin of such correlation 
is still far from clear: both parameters (variability 
and spectral shape) may be related to BH mass 
and accretion rate, which would induce different 
degrees of variability while, at the same time, cre- 
ating different conditions in the accretion disk and 
its corona (optical depth, temperature) explaining 
the different spectra (see for instance Turner et al. 
1999). Reprocessing of the original radiation has 
also been invoked (e.g. Green, McHardy & Lehto 
1993; Koenig, Staubert & Wilms 1997), since 
high energy photons would be produced through a 
larger number of collision which also tend to smear 
out the variability. 

We note that these models have been invoked to 
account for observations of short timescales vari- 
ability and moderate spectral changes (Ar < 1) 
in samples of Scyfert 1 galaxies, i.e. where we are 
observing more or less directly the central engine. 
Our data, instead, span a large range in HR, im- 
plying variations in the photon index up to <~ 2 
and large column densities. The hardest sources 
in our sample are in fact consistent with column 
densities found in type II AGNs. An alternative 
explanation of the correlation between variability 
and absorption in the CDFS sources is that the X- 
ray observed emission is due to two main compo- 
nents: a highly variable one which originates near 
the central engine and a reflected one, produced 
by material distributed over a large region which 
dilutes the intrinsic variability of the primary com- 
ponent. When the central engine is observed di- 
rectly, i.e. through low column densities, the first 
component dominates the total flux and thus the 
variability. In high Njj sources, instead, the pri- 
mary component is highly absorbed. Then, if the 
reflecting material is distributed at large distances 
from the central BH or in a geometrical configu- 
ration such that it is relatively unaffected by the 
obscuring material (e.g. the molecular torus in the 
classic unification scheme), the scattered compo- 
nent will represent an increasingly larger fraction 
of the observed emission thus accounting for the 



lower variability. This scenario is supported by the 
recent detection of a soft component in the XMM 
spectra of several distant type-2 QSO in the CDFS 
field (Strcblyanska et al. 2003), which is well fitted 
by a scattering model. 

5.3. Dependence of the variability ampli- 
tude on X-ray luminosity and redshift 

An anti-correlation between luminosity and 
variability of AGNs in the X-ray band was origi- 
nally suggested by Barr & Mushotzky (1986) and 
later confirmed by several authors (e.g. Lawrence 
& Papadakis 1993; Green, McHardy & Lehto 
1993; Nandra et al. 1997). Other works, however, 
pointed out that the luminosity-variability corre- 
lation is affected by significant scatter and may 
depend on the AGN properties (line width, spec- 
tral shape etc.; Nandra et al. 1997; Turner et al. 
1999; Fiore et al. 1998). 

Given these premises, it is not surprising that 
in our data it is difficult to clearly identify a corre- 
lation between these two quantities, since we lack 
detailed information on the properties of CDFS 
sources and are thus likely to be including many 
different classes of objects in our sample. The 
problem is further enhanced by the low statis- 
tics and by the sparse sampling of the lightcurves. 
Nevertheless we have shown in § 4.4 that our 
observations are consistent with the presence of 
such anti-correlation, particularly when we use the 
maximum excess variance which is less affected by 
statistical errors and partly removes the effect of 
the random sampling. 

A more quantitative comparison with previ- 
ous works is difficult because the large uncertain- 
ties in our data do not allow to put precise con- 
straints on the slope of the Lx— variability rela- 
tion. Restricting our analysis to sources in the 
range 0.5 < z < 1.3 we obtain a power-law slope 
of -1.31 ± 0.23 (or -1.17 ± 0.14 using cr^J, 
steeper than the average slope ~ —0.6 usually 
found in literature for local AGNs (see for instance 
Lawrence & Papadakis 1993; Green, McHardy & 
Lehto 1993; Nandra et al. 1997). Note however 
that these studies sampled nearby (z < 0.1) AGNs 
on short timescales (less than one day). At higher 
redshift the situation is less clear: Almaini et al. 
(2000) found a slope of —1.5 ±0.2 analysing a deep 
ROSAT sample of AGNs at z < 0.5 on timescales 
of days to weeks. On the other hand the enlarged 
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ROSAT sample studied by Manners, Almaini & 
Lawrence (2002) yields a value of 0.54 ± 0.10, 
closer to the local one. The heterogeneous compo- 
sition of moderate and high redshift samples, for 
which accurate spectroscopic classification are not 
available, may explain the steeper slope: Turner et 
al. (1999), for instance, show that the inclusion of 
narrow line Seyfert galaxies in their sample intro- 
duces large scatter, possibly increasing the slope. 
A better selected sample is required to properly 
address this issue. 

A more interesting result is our finding that 
high redshift sources behave differently, in terms 
of variability, than their low redshift counterparts 
(§ 4.4). Sources with z > 1.3 have large ex- 
cess variances which are inconsistent with those 
expected from intermediate- z (0.5 < z < 1.3) 
sources of comparable luminosities. This agrees 
with Almaini et al. (2000) and Manners, Almaini 
& Lawrence (2002), which found that their most 
distant and luminous sources do not obey the 
same correlation of nearby ones, being more vari- 
able. Both redshift evolution and luminosity can 
be invoked as the parameters producing the "up- 
turn" at high L Xl given the degeneracy between 
the two quantities in flux limited samples. Un- 
fortunately the luminosity ranges sampled by in- 
termediate and high z sources in our data (as 
in those of Almaini and Manners) scarcely over- 
lap, and thus do not permit determining if the 
Lx— variability relation for intermediate- z sources 
extends to higher luminosities. Nevertheless the 
faintest source {Lx — 10 43 ) in our high-z subsam- 
ple is highly variable {a 2 ms ~ 1, see Figure 13), 
suggesting that a correlation between Lx and a 2 , 
with similar slope to the one found for intermedi- 
ate objects, exists in distant sources. Furthermore 
sources with z < 0.5 tend to lie on the lower-left 
side of the diagram: unless we assume that there is 
also a "downturn" of the —variability relation 
at the lowest luminosities, this supports the idea 
that evolution shifts the whole correlation toward 
lower variabilities and/or luminosities. Compar- 
ing our data with Almaini et al. (2000) and Man- 
ners, Almaini & Lawrence (2002) we also note 
that their "upturn" occurs at much higher lumi- 
nosities than in our data. Thus the interpretation 
that it may reflect a maximum in the black hole 
size distribution (Almaini et al. 2000) does not 
hold for our sample. The most likely scenario is 



that the same anti-correlation is present up to at 
least z ~ 3 but that it evolves with redshit, in 
the sense that objects with comparable luminosi- 
ties were more variable in the past. The "upturn" 
is produced by selection effects which prevent us 
from observing low luminosity AGNs at high red- 
shift. 

Possible interpretations of this result have been 
explored by Manners and collaborators (2002). 
They include an intrinsic change in AGN lumi- 
nosity triggered by higher accretion efficiency or 
a change in variability either due to a smaller 
X-ray emitting region or to fainter X-ray flares. 
Our data further support these interpretations. In 
fact we can rule out the possibility that the ob- 
served increase in variability at high z is due to 
bandpass effects, because we are using a broader 
spectral band than the ROSAT one used by Man- 
ners, Almaini & Lawrence (2002) and because 
we detected the same trend using a fixed rest- 
frame band (§ 4.4). The possibility that larger 
variability is due to obscuration provoked by inter- 
vening obscuring material across the line-of-sight 
seems also unlikely, since no correlation between 
flux and hardness ratio is detected in the majority 
of our sources (§ 4.3.2). Finally it is possible that 
the emergence of a population of highly variable 
AGN at high redshift, such as narrow- line Seyfert 
1, may increase the variability in the high redshift 
subsample. However the preliminary inspection of 
the optical spectra of CDFS sources, obtained by 
Szokoly et al. (2004, in preparation), suggests that 
our most distant X-ray sources include both nar- 
row and broad emission line AGNs whose relative 
abundance does not differ from the one found at 
lower redshifts. 

5.4. Variable fraction evolution 

In § 4.5 we showed that the fraction of variable 
sources decreases with redshift for 0.5 < z < 2. 
While the significance of the result (< 3c) does not 
allow to exclude that this decrease is due to sta- 
tistical uncertainties, we argued that it is not due 
to systematic effects due to either flux, timescale 
or bandpass selection biases. On the other hand 
luminosity selection effects may produce the ob- 
served trend since distant bright sources are less 
variable than their faint nearby counterparts. This 
interpretation however requires that all sources 
z < 2 follow approximately the same anticorre- 
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lation, and any significant increase in variability 
occurs only at larger redshifts. This possibility 
cannot be excluded based on the Lx — of ms cor ~ 
relation shown in Figure 12 and would be consis- 
tent with Manners, Almaini & Lawrence (2002), 
who also found larger variability amplitude only 
for z > 2. 

Alternatively we note that Gilli et al. (2003) 
detected the presence of a cosmic structure at 
z ~ 0.7 in the CDFS. This is also the redshift 
where most of our variable sources are located. If 
environmental properties trigger AGN variability, 
it is possible that the large variable fraction at low 
z is due to the presence of such a structure. Fi- 
nally, if variability is correlated with intrinsic ab- 
sorption and the variable/non- variable separation 
reflects the Type I/Type II classification, as dis- 
cussed in § 5.2, the decrease in variable fraction 
may reflect the evolution of the Type I/II ratio. 
In fact AGN population synthesis models trying 
to explain the observed X-ray background, pre- 
dict a decrease of the Type I/II ratio by a factor 
> 2 between z = and z ~ 1.3 (Gilli et al. 2001). 

Future work using spectroscopic redshifts and 
an extended sample to minimize the statistical un- 
certainties, possibly including the Chandra Deep 
Field North, may be decisive in better constrain- 
ing the evolution of AGN variability at high red- 
shift. 

6. Conclusions 

We exploited the deep (1 Ms) and high- 
resolution data of the Chandra Deep Field South 
to study the temporal variability of AGNs on 
timescales ranging from a fraction of a day up 
to one year. While similar studies have been per- 
formed on samples of nearby AGNs, our sample 
primarily contains moderate luminosity AGNs out 
to z ~ 3.5. 

We detect significant variability in > 50% of 
the AGN population in the 0.5-7 keV band and 
suggest that as many as 90% of all AGN in our 
sample may be variable. The variability exists on 
all timescales that we can observe, but our sources 
are twice as variable on timescales of > 100 days 
than within a few (< 10) days. All sources show 
some variability on the shortest timescales indicat- 
ing that part of the flux variations are produced on 
spatial scales < 2 x 10 -3 pc, corresponding to the 



inner part of the accretion disk and of the Broad 
Emission Line region. 

We also find that unabsorbed sources (log Nh < 
22.5 cm -2 ) appear intrinsically more variable than 
absorbed ones at the 2a significance level, suggest- 
ing that the lack of variability in hard, absorbed 
sources is due to an increased contribution of re- 
flected X-rays to the total flux. On the other hand 
we do not find strong (AT > 0.2) spectral changes 
associated with the flux variations in the majority 
of our sources. 

Our data are consistent with an anti-correlation 
between X-ray luminosity and variability, similar 
to that found in local AGNs. However, this re- 
lation seems to evolve with redshift, with X-ray 
sources having been more variable in the past. 
This trend may reflect an evolution in the accre- 
tion rate and/or size of the X-ray emitting region 
with look-back time. 

Finally wc find a possible decrease in the frac- 
tion of variable sources with redshift for 0.5 < z < 
2. This result could be explained by assuming that 
AGN variability has not evolved since z <~ 2. Al- 
ternatively, environmental effects triggering X-ray 
variability or an evolution in the Type I/Type II 
AGN ratio could produce the observed trend. Fu- 
ture work using spectrosopic redshift and an en- 
larged sample may be able to address this issue. 
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Appendix 



A. X-ray sources simulations 

To simulate the lightcurve of a source in the CDFS, in the assumption that the X-ray flux remains constant 
in time, we used the following procedure: 

1. we took the average net source count-rate, measured from the stacked observation as described in § 
3.1, and rescaled it to the actual exposure (including the exposure time and the position-dependent 
sensitivity correction) of the real source in each individual observation, obtaining the average expected 
counts n s within the source region for a non-variable source; 

2. the average background counts expected within the same region were derived in a similar way, using 
the average background count-rate fib measured in the background region, rescaled to the source to 
background area ratio A s /Ab] 

3. the simulated counts within the source region are then given by C to t — C s + C&, where the contribution 
of the source and the background, C s and Cb, were randomly extracted from a Poisson distribution 
with mean value n s and (fib * A s /Ab) respectively; 

4. the counts expected from the background region were accordingly sampled from a Poissonian distribu- 
tion with mean value rib. 

This simulated observation of the source and the background for each exposure includes the dependence on 
the position of the source on the detector, the actual exposure time and obeys the correct statistics. 

B. Sources of uncertainty in the variability estimates 

QE degradation: it is known that the Chandra ACIS quantum efficiency (QE) has undergone a continuous 
degradation since launch in 1999, due to molecular contamination building up on the cold optical blocking 
filter, and/or the CCD chips. This degradation is most severe at low energies below 1 keV, producing a 
decrease in the QE at 670 eV of about 10% per year 6 . This effect may introduce spurious variability in our 
sources, since the observations span a large time interval. To estimate the influence of the QE degradations 
we used the ACISABS absorption model of the Sherpa fitting software, included in the CIAO 2.3 package. 
We simulated the spectrum of an average CDFS source, composed of a power law with photon index T = 1.44 
absorbed by the galactic column density Nh = 8 x 10~ 19 cm~ 2 (Tozzi et al. 2001). We then applied to this 
model the additional absorption estimated by the ACISABS model for each individual observation, and we 
calculated the flux ratio between each observation and the last one (ObsID 2239). This factor estimates the 
decrease of the observed flux at each observing epoch and was used to normalize the net source counts to the 
last observation. Not surprisingly, the correction was found to be significant only for the first 4 exposures, 
since the following ones were acquired very close to each other (i.e. within 13 days, see Table 1). The 
correction factors were estimated to be 6% for ObsID 1431-0 and 1431-1, and 2% for 441 and 582 in the 
0.5-7 keV band, and slightly larger in the soft (0.5-2) keV band: respectively 9% (3%) in the first (second) 
two observations. The hard (2-7keV) band is unaffected by this problem, yielding corrections smaller than 
1%. 

Exposure correction: the actual number of counts measured by Chandra for any CDFS source in a single 
observation depends on the source position on the detector due to the decrease of the mirror's effective area 
as a function of off-axis angle and QE variations of the ACIS CCDs. This difference is taken into account 
in our analysis through the exposure correction, normalizing the observed count rates to those expected at 

'More details at http://cxc.harvard.edu/cal/Acis/Cal_prods/qeDeg/ 
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the aimpoint (§ 3.1). The exposure maps themselves however, depend on the spectral energy distribution 
(SED) of the source. Since for most sources there are not enough counts to obtain a reliable SED, we used 
the average hard exposure maps centered at 4.5 keV from Giacconi et al. (2002). If a source has a very soft 
spectrum however, this may introduce spurious variations in the exposure-corrected counts. We estimated 
the influence of the exposure correction comparing the hard exposure maps to those produced for the soft 
band and centered at 1.5 keV. We found that the maximum difference between soft and hard exposure maps 
is < 8% except in ObsID 1431-0 and 1431-1 where it is <~ 10%. The CDFS observations have approximately 
the same aimpoint so that the source position on the detector changes mostly along the azimuthal direction, 
with small variations in the distance from the aimpoint. Since the exposure maps are largely azimuthally 
symmetric as well, these uncertainties due to the exposure corrections must be considered upper limits, with 
most sources affected to a much lesser extent. 

PSF variations: another concern in studying temporal variability is the influence of using a fixed extraction 
radius, while the PSF size varies between observations due to the different source position on the detector. 
The influence of a varying PSF, however, is minimized by the fact that the main parameter affecting the PSF 
size is the radial distance from the aimpoint 7 . Since the CDFS observations have only small offsets (< 20") 
the changes in the PSF size are minimal. We quantified this uncertainty factor by extracting encircled energy 
profiles for a few bright off-axis sources in our field: we found that the use of the average 95% encircled 
energy radius (§ 3.1) produces count uncertainties smaller than 2% between different exposures. 

Sources on edges: we noted in § 3.1 that, in order to maximize the number of bins for each source, we 
retained in our lightcurves observations where the source falls in part outside the FOV. The criteria for 
inclusion were chosen to minimize the effect of the missing area on the count extraction. The requirement to 
have at least 50% of the original background area allows a reliable background estimate albeit with reduced 
accuracy; this is properly taken into account in the error estimate. When the source region falls partly outside 
the FOV instead, we estimated that the 10% upper limit on the missing area yields an underestimate of the 
source counts < 2%, since the affected region lies on the PSF wings. 
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Table 1 
Chandra observations OF CDFS 



Obs. ID Obs. Date Exp. Time (ks) a Roll Angle Aim Point 



1431-0 


1999 


Oct 15 


24.983 


47.28 


3 


32 


29.4 


-27 48 


21.8 


1431-1 


1999 


Nov 23 


92.807 


47.28 


3 


.32 


29.4 


-27 48 


21.8 


441 


2000 


May 27 


55.727 


166.73 


3 


32 


26.8 


-27 48 


17.4 


582 


2000 


Jun 03 


129.869 


162.93 


3 


32 


26.8 


-27 48 


16.4 


2406 


2000 


Dec 10 


29.564 


332.18 


3 


32 


28.4 


-27 48 


39.3 


2405 


2000 


Dec 11 


59.363 


331.81 


3 


32 


29.0 


-27 48 


46.4 


2312 


2000 


Dec 13 


123.212 


329.92 


3 


32 


28.4 


-27 48 


39.8 


1672 


2000 


Dec 16 


94.564 


326.90 


3 


32 


28.9 


-27 48 


47.5 


2409 


2000 


Dec 19 


68.719 


319.21 


3 


32 


28.2 


-27 48 


41.8 


2313 


2000 


Dec 21 


129.937 


319.21 


3 


32 


28.2 


-27 48 


41.9 


2239 


2000 


Dec 23 


130.250 


319.21 


3 


32 


28.2 


-27 48 


41.8 



a cffcctive exposure time after cleaning bad aspect interval and high background inter- 
vals (about 8800 s are lost due to high background) 



0.0014 ; 
0.0012 ; 
0.0010 ■ 
0.0008 ; 
0.0006 ; 
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Tot. net counts: 642 
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Days Days Days 



Fig. 1. — A typical lightcurve of a variable CDFS source with <~ 600 counts. Each bin represent an individual 
Chandra observation. The dotted line represents the average count rate, while the shaded region shows the 
error on the average value. The gaps between panels represent extended periods without observations. 
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Fig. 2. — x 2 distribution of the CDFS source lightcurves (continuous line) compared to those derived from 
simulations (dashed line) and to the one expected in the case of a normal distribution (dotted line). The 
distributions are normalized to the total number of sources. Note that even though the plot is truncated at 
X 2 = 50, there are several sources with larger x 2 . 
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Fig. 3. — Excess variance vs total net counts. Empty (filled) circles represent non-variable (variable) sources. 
The continuous line represents the 95% upper limit on the excess variance due to statistical uncertainties 
for a constant source, as derived from simulations. 
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Total net counts 



Fig. 4. — Fraction of variable sources vs net counts. The arrows show increase in the fraction of sources 
which are found to be variable in the 0.5-7 keV band, selecting increasingly higher S/N subsamplcs (i.e. 
sources with total counts> Net counts). The upper curve is slightly shifted for visualization purposes. 
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Fig. 5. — Hardness ratio vs net counts. The filled and empty circles represent variable and non- variable 
sources respectively. Each dotted line represent the track followed by a source at a given redshift (z = 
0,0.2,0.5,1.0,1.5,3,5) as the intrinsic absorption increases (logiVp = 19.9,22,22.5,23 cm~ 2 ). The tracks 
are normalized to fx — 10~ 14 erg s _1 cm~ 2 (1000 counts) at z = 0; fainter normalization would shift the 
tracks to the left. 
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Fig. 6. — Hardness ratio vs redshift for variable (filled circles) and non-variable (empty circles) sources. The 
dashed lines show the HR expected for a source with T = 1.8 and increasing intrinsic absorbing column 
(logNn — 22,22.5,23,23.7 cm -2 ). The lower level (horizontal line) corresponds to galactic absorption of 
N H = 8 x 10 19 cm -2 . 
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Fig. 7. — Cumulative distribution of absorbed (log Nh > 22.5 cm -2 ) and unabsorbed (log Nh < 22.5 cm -2 ) 
sources vs excess variance, showing the probability that the two subsamples are extracted from the same 
population according to a KS test. 
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Fig. 8. — Distribution of the excess variance (o^ m8 ) and max. excess variance (a^J in the CDFS sources. 
The median flux variations (median a rnis and a-max) are also shown. 
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Fig. 9. — Count rate vs hardness ratio for CDFS sources. The three panels show examples of sources with no 
spectral variability (left), spectral variability but no correlation with flux (center) and negative correlation 
with flux (right). Each dot represent a single exposure while the square shows the mean value obtained from 
the stacked observation. The best-fit line is also shown. 
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Fig. 10. — Left panel: Distribution of variability timescales in the observer rest-frame (0.5-7 keV band). 
The bin size is variable and the histogram has been normalized by the number of times each timescale has 
been sampled in the CDFS observations. The gaps are due to the irregular sampling of the CDFS field. 
Right panel: timescale distribution in the source rest-frame for the subset of objects with available redshift. 
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Fig. 11. — Distribution of the minimum restframe (right panel) timescales in the 0.5-7 keV band. The 
cumulative fraction is shown in the inner panel. 
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Fig. 12. — Lx vs excess variance. Filled (empty) circles represent variable (non-variable) sources and their 
size is coded according to rcdshift. Horizontal error bars reflect uncertainties on both the source flux and 
the photometric rcdshift. The dashed line shows the biweight mean of the excess variance calculated in 
logarithmic luminosity bins. Sources with negative excess variance are plotted at u rms = 1CP 3 . 
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Fig. 13. — Lx vs max. excess variance. Symbols have the same meaning as in Figure 12. The dashed line 
shows the best fit to sources with 0.5 < z < 1.3. 
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Fig. 14. — Luminosity vs redshift. Variable and non-variable sources are represented respectively by filled 
and empty circles. The horizontal dotted line marks the L x = 10 42 erg s -1 limit. The curves show the 
minimum CDFS detection limit (continuous line), the average limit (dashed line) and the 100 counts limit 
(dot-dashed line). Inner panel: Fraction of variable sources vs redshift. The fraction is calculated using a 
running average with a bin size of Az = 2 except at the edges of the data range where Az is truncated. The 
shaded region shows the statistical uncertainty; it does not include the dependence on the average source 
flux, which is found to be negligible. 
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Fig. 15. — Lx vs excess variance. Symbols have the same meaning of Fig. 12. The dashed line represents 
the best fit to sources with 0.5 < z < 1.3. The continuous lines show the average variability detection limit 
from Figure 3 for sources at z = 0.8 and z = 2.5. At high redshift the higher variability detection limit is 
balanced by the increase in excess variance of distant sources. 
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